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Abstract. The absolute change in the Rayleigh quotient (RQ) is bounded in this paper in 
terms of the norm of the residual and the change in the vector. If x is an eigenvector of a self- 
adjoint bounded operator A in a Hilbert space, then the RQ of the vector x, denoted by p(x), is 
an exact eigenvalue of A. In this case, the absolute change of the RQ \p(x) — p(y)\ becomes the 
absolute error in an eigenvalue p(x) of A approximated by the RQ p(y) on a given vector y. There 
are three traditional kinds of bounds of the eigenvalue error: a priori bounds via the angle between 
vectors x and y\ a posteriori bounds via the norm of the residual Ay — p(y)y of vector y; mixed type 
bounds using both the angle and the norm of the residual. Wo propose a unifying approach to prove 
known bounds of the spectrum, analyze their sharpness, and derive new sharper bounds. The proof 
approach is based on novel RQ vector perturbation identities. 
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1. Introduction. Let A be a bounded self-adjoint operator in a real or complex 
Hilbert space T-L. For a nonzero vector x, the Rayleigh quotient (RQ) is defined by 

p(x)=p(x,A) = )Er™L 

[X, X) 

and the corresponding residual vector is denoted by r (x) = r(x, A) = Ax — p(x)x, 
where (-, •) is an inner product, associated with a norm by || • || 2 = (-,-). The acute 
angle between two nonzero vectors x and y is denoted by 

,< i \( x ,y)\ 

Z {x, y\ = arccos ■ 



\Ah\\ ' 

We are interested in RQ vector perturbation bounds. Specifically, for nonzero 
vectors x and y, we want to bound the following quantity, \p(y) — p(x)\, in terms of 
Z {x, y}, \\r(x)\\, and If x is an eigenvector of A, then the RQ of x is an exact 

eigenvalue of A and r(x) = 0. In this case, the absolute change of the RQ \p(y) — p(x)\ 
becomes the absolute error in the eigenvalue p(x) of A. The RQ is often used to 
approximate points of the spectrum of A. 

Known bounds of the spectrum are traditionally classified depending on the terms 
that appear in the bounds. Let A G be approximated by p{y), i.e., the absolute 

approximation error is \p(y) — A|. Bounds of \p(y) — A| that are based on the norm 
of the residual ||r(y)|| are called "a posteriori" bounds, since the residual r(y) and its 
norm can typically be computed for the given vector y. 

If A is an eigenvalue with the corresponding eigenvector x, bounds of \p(y) — A 
that rely on the angle Z {x, y} are called "a priori," since the eigenvector x is usually 
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not explicitly known, and some a priori information about it needs to be used to bound 
the angle Z {x, y}. In the context of the Finite Element Method error bounds, where 
x and y are functions, such information is usually associated with the smoothness of 
the function x; see, e.g., [7]. Finally, we call a bound "a mixed type," if it involves 
both terms, the residual norm ||r(y)|| and the angle Z {x, y}. 

Sharp bounds of the spectrum approximation error are very important in the 
theory of numerical solution of self-adjoint eigenvalue problems. Computable a poste- 
riori bounds allow one to obtain two-sided approximations to points of the spectrum. 
A priori and mixed bounds give one an opportunity to determine the quality of ap- 
proximation of a point of the spectrum by the RQ. 

We revisit this classical topic of research for two reasons. Our first goal is to 
carefully examine a posteriori and a priori bounds together, to understand what they 
have in common and why they are so different. Second, we are interested in discovering 
a general framework for deriving bounds of \p(y) — p{x)\ for arbitrary vectors x and 
y where known bounds of the spectrum approximation error become corollaries. 

We start, in section 2, by reviewing three known bounds of the spectrum ap- 
proximation error, each representing a priori, a posteriori, and mixed types, corre- 
spondingly. In section 3, we derive new identities and bounds for the change in the 
RQ with respect to the change of the vectors, using an orthogonal projector on the 
subspacc span{x,y}. The idea of such a "low-dimensional" analysis has been found 
fruitful before, e.g., in [3]. We observe that in the case dim(H) = 2 all three bounds, 
which we want to reinvent, turn into the same identity. In section 4, we derive the a 
priori and mixed type bounds from this identity, using a single inequality, different for 
each bound, only at the last step. This unifying proof technique allows us to easily 
specify the circumstances, where the bonds are sharp. We dedicate separate section 5 
to a posteriori bounds. Our "low-dimensional" analysis not only allows us to find a 
novel proof of a well known a posteriori bound, but also gives us an opportunity to 
improve it, and obtain several new sharper results. Our bonus section 6 touches the 
topic of improving known sin(2#) and tan(#) error bounds for eigenvectors. 

2. Short review of some known error bounds for eigenvalues. Let y be an 

approximation to an eigenvector x of the operator A, corresponding to the eigenvalue 
p{x) of A. An a priori bound involves a constant and the square of the sine of the 
angle between the eigenvector x and the vector y, see, e.g., [5, 6, 9], 

(2.1) |A-/>(y)| < (£ma X (A)-£ min (A))sin 2 (Z{a;,y}), 

where we denote A = p{x), and E max (^4) and S m i n (A) denote the largest and the 
smallest points of the spectrum of A, correspondingly. 

The mixed type bound shown in [4, 11] is in terms of the norm of the residual 
vector and the tangent of the angle between vectors x and y, i.e., 

(2.2) \ X - p (y)\<t^tzn(Z{x,y}). 

\\v\\ 

There is no constraint on the location of the eigenvalue p(x) relative to p{y) 
in (2.1) and (2.2). Bounds (2.1) and (2.2) hold for any nonzero vector y and any 
eigenvector x. The next bound we review requires having some information about 
the spectrum of A in the neighborhood of p(y). This a posteriori bound involves the 
square of the norm of the residual vector and the gap between p(y) and the rest of 
the spectrum of A. Let us choose real numbers a and /3 such that a < p (y) < j3 
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and S (A) n (a, (3) = 0, where £ (A) represents the spectrum of A. Then the Temple 
bound holds, see, e.g., [2, 13], 

(2-3) (P-p(y))(p(y)-a)<^^ . 

The Temple bound (2.3) turns into an equality if both a and /3 are eigenvalues of A 
and the vector y is a linear combination of the corresponding eigenvectors. 

Let now the only point of the spectrum £ (A) in the interval (a, b) be an eigenvalue 
A and let also p(y) £ (a, b), then 

(2.4) -^ l W<p(y)->< 1 l|r(y) " 2 



p(y)-a \\y\\* ~™ ~ b~p(y) \\y\\* ' 

This is Kato- Temple's inequality [2, 13], and it easily follows from (2.3). Indeed, in 
the case where A is located to the right of p(y) then we take a = a and j3 = A in (2.3) 
to get the lower bound of (2.4). In the opposite case, we take a = A and /3 = b to 
obtain the upper bound of (2.4). 

If we take a = p(y)—6 and b = p(y)+8 in (2.4), where S = min t;e {2(A)/A}|»?— p{y% 
then (2.4) directly implies, see, e.g., [8, 10, 12], 

1 \W 

(2-5) |A-p(y)|< 7 " 



<* llvl 
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Bound (2.5) turns into an equality if A — 5 or A + S is an eigenvalue of A and the 
vector y is a linear combination of the corresponding to A — 5 or A + 5 eigenvector and 
an eigenvector corresponding to A. 

Since among all points of the spectrum of A, the isolated point A £ T,(A) is the 
closest one to p(y), we evidently have from (2.5) that 



\X-p(y)\ 2 <\X-p(y)\S< 
which gives the Krylov-Weinstcin bound, sec, e.g., 

\X- P (y)\ < 



\ r (y) 



2 



lyll 2 



WW 

This derivation of the Krylov-Weinstein bound from bound (2.5), although trivial, is 
not apparently very well known. It demonstrates that the Krylov-Weinstein bound is 
simply a particular case of (2.5), where |A — p(y)\ < S. 

Our discussion above shows that bound (2.3) is the most fundamental a posteriori 
bound, since other a posteriori bounds, reviewed here, can be derived from it. The 
main goal of the rest of the paper is to revisit bounds (2.1), (2.2), and (2.3). Wc 
propose a new unifying approach to prove them, analyze their sharpness, and derive 
some new sharper bounds. 

3. Key identities for the RQ. We present several key identities for the abso- 
lute change in the RQ. 

Let A be a bounded self-adjoint operator on a real or complex Hilbert space H. Let 
S denote a subspace of H and Ps be an orthogonal projector on S. Let As = (PsA) \s 
denote the restriction of the operator PsA to the subspace S. For a nonzero vector 
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x £ S, we denote p(x,Ag) = {x, Agx) / {x,x) and r(x,Ag) = A$x — p(x,A$)x £ S. 
We start with a couple of trivial, but extremely important, for our approach, lemmas. 

Lemma 3.1. If x £ S then p(x, A s ) — p{x). 

Proof. The orthogonal projector P$ is self-adjoint, so 

/ , \ (x,P s Ax) (x,Ax) 

p(x, A S ) = — r— = r = p(x). 

[X, X) [X, x) 

□ 

Lemma 3.2. If x e S then r(x,A s ) = Psr(x). 

Proof. Directly by the definition and Lemma 3.1, we obtain 

r(x, A s ) = A s x - p(x, A s )x = P s Ax - p(x)x = P s (Ax - p(x)x) = P s r(x). 

□ 

In the rest of the paper, for given linearly independent vectors x and 
y in W we always define the subspace S as S = sp&n{x,y}. 

Corollary 3.3. // one of the vectors x or y is an eigenvector of A, it is also an 
eigenvector of As- 

Proof. Let x be an eigenvector of A, then r{x) = 0, so r(x,As) = Psr(x) = 0, 
which means that x is also an eigenvector of As ■ □ 

Remark 3.4. It is important to realize that the operator As acts in a two- 
dimensional space, thus many statements concerning As can be easily proved because 
of this. Here, we collect some basic identities, using the eigenvectors of As- 

Let us denote p = A max (As) = p{u\) and v = X m i n (As) = p{u2), where u\ and 
U2 are orthogonal eigenvectors of the operator As- Let Pi with i = 1, 2 denote the 
orthogonal projectors on the subspace span(u,). Assuming a nontrivial case PiX ^ 
for i = 1, 2, we evidently have Z {x 1 Ui] = Z {x, Pix}. Now, 

= (x,A s x) = (PjX, A s Pix) + (P 2 x, A s P 2 x) = p{P 1 x,P l x) +v(P 2 x,P 2 x) 
P[X) (x,x) (P 1 x,P 1 x) + (P 2 x,P 2 x) (Pix,P lX ) + (P 2 x,P 2 x) 

since x = Pyx + P 2 x and P\P 2 = P\AsP 2 = 0. Therefore, 

(3.1) p — p (x) = (p — v) sin 2 (Z {x, ui}) and p (x) — v = (p — v) sin 2 (Z {x, u 2 }) ■ 
Evidently, in a two-dimensional space, 

(3.2) cos (Z {x, ui}) = sin (Z {x, u 2 }) and cos (Z {x, u 2 }) = sin (Z {x, u±}) . 



3.1. Identities for the norm of the residual r (x) = Ax — p(x)x. Our main 
identity is in the following lemma. 
Lemma 3.5. 

(3.3) [A max (A S ) - p(x)} [p(x) - A min (A 3 )] = l|P ^^ )l12 . 

Iff 

Proof. Let us denote p = A max (As) and v = A m i n (^4s). By Lemmas 3.1 and 3.2, 
identity (3.3) can be equivalcntly rewritten as 

[p p(x, A s )] [p(x, A s ) -u]= Hx i ; A l s : )r . 
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Since (r(x,A s ),x) = 0, we have \\r(x, A s )\\ 2 = \\A s x\\ 2 - p 2 (x, A s )\\x\\ 2 and thus 

l|r( ^ 2 S) " 2 - ^ P( x > A s))(p(x, A s ) -u) = - (/* + u)p(x, A s ) + »u 

(Ago: — fix, Asx — vx) 

= M 2 
-0, 

where (As — fJ>) (As — v) = is the minimal polynomial of As , since dimS 1 = 2 and \x 

and v are the eigenvalues of As- □ 

Remark 3.6. Let x be an eigenvector of A in Lemma 3.5. Then one of the two 

eigenvalues of As is equal to p(x). 

Proof. The statement follows directly from Lemma 3.1 and Corollary 3.3. □ 

We also mention the following identity, relying on eigenvectors of As- 

Lemma 3.7. Let m with i = 1, 2 denote the eigenvectors of As- For both i = 1 

and 2, we have 

(3.4) 1 [A max (A s ) - A min (A s )] sin (2Z {x, m}) - 



1 ' 'max \ ji o / ' "mm \ J y 1 ^^^^ v 1 7 ""i 1 / 

2 

Proof. By Lemma 3.2, and using the notation of Remark 3.4, identity (3.4) can 
be equivalently rewritten as 

1. . . .„ . . \\r(x, A s )\\ 

- [ M - v] sin (2Z {x, m}) = " 11^11 • 

The statement of the lemma follows from Lemma 3.5 and the trigonometry identity 
for the sine of the double angle. □ 

3.2. Identities for the absolute change in the RQ. Here we prove our main 
tangent- and sine-based identities. 

Remark 3.8. Let Z{x,y} < tt/2. We note the following useful identities 

\\r(x)\\ cos (Z{r(x),y}) = \{r(x),y) = \(P s r(x),y)\ = \\P s r(x) \\ cos (Z {P s r(x), y}) 
\\x\\ cos(Z{x,y}) \(x,y)\ \(x,y}\ \\x\\ cos (Z {x, y}) 

and 

\\P s r(x)\\ cos (Z {P s r(x),y}) \\P s r (x) || 



cos (Z {x, y}) \\x\\ 



tan(Z{a;,y}) , 



where cos (Z {Psr(x), y}) = sin (Z \x, y}) . Indeed, = (r(x),x) = (Psr(x),x) , i.e., 
vectors Psr(x) 6 S and x € S are orthogonal in S, and also y E S, where dim5 = 2. 
Let us finally also give an alternative simple identity, 

I (r(x),y) I _ I (P y r(x),y) \ _ \\P y r(x)\\ cos (Z {P y r(x), y}) _ \\P y r(x)\\ 



\(x,y)\ \(x,y)\ \\x\\ cos (Z{x,y}) ||x||cos(Z{a;,y})' 

where P y is an orthogonal projector on the subspace span{j/} and Z {P y r(x),y} = 0. 
Theorem 3.9. Let Z{x, y} < n/2 and let us denote 



=■ 1 = 



I Pgr (a;) || ± \\P s r (y) 



\\y\ 



tan (Z {x, y}) 



G 
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We always have 

S- < \p(x)-p(y)\ < S+. 
Now, let us assume that (x,r(y)) / (x, y) or (r(x),y) / (x,y) is real. Then 

\p{x) - p{y)\ =S+ 
if (x,r(y)) j (r(x),y) < or (r(x),y) / (x,r(y)) < 0, otherwise, 

\p(x) -p(y)\ = 3_. 

Proof. By elementary calculations, we obtain 

(p(x) - p(y)) (x, y) = (p(x)x, y) - (x, p(y)y) + (x, Ay) - (Ax, y) 
= (x,r(y)) - (r(x),y) , 

thus 

(3.5) \p(*)-p(y)\= l{xM f- { ;l x) > y)l . 

I (x,y) 

In a complex space, the RQ remains real, since A is Hcrmitian, but the scalar products 
may not be real, so in general we have, by the triangle inequality, that 

|| (x,r(y)) | - | (r(x),y) || < | (x,r(y)} - (r{x),y) \ < \ (x,r(y)) \ + \ (r(x),y) |. 

Unless one of the ratios (x,r(y)) / (x,y) or (r(x),y) / (x,y) is real, the inequalities 
above are strict. Indeed, the difference of the ratios is equal to p(x) — p[y), i-e., is 
always real. Thus, one of the ratios is real if and only if one of the scalar products 
(x, r(y)} and (r(x), y) is a real scalar times the other. 

The assumption Z{x,y} < it/2 implies that at least one of the scalar products 
(x,r(y)) and (r(x),y) is non-zero. Let it be, e.g., (r(x),y). If (x,r(y)) / (r(x),y) > 
then the left inequality turns into an equality. If (x, r(y)) / (r(x), y) < then the right 
inequality turns into an equality. All other cases, possible in a complex space, lead to 
strict inequalities above. 

The statements of the theorem now follow directly from Remark 3.8. □ 

Lemma 3.10. For both i = 1 and 2, we have 

(3.6) \p(x) - p(y)\ = \n - v\ sin (Z {x, + Z {y, itj) |sin (Z {x, u t } - Z {y, Ui})\ . 

Proof. Identities (3.1) and (3.2) of Remark 3.4 imply 

\p{x) - p (y) | = [A* - v] |cos 2 (Z{x,Ui}) - cos 2 (Z{y,Ui\) \ , 

which leads directly to identity (3.6) using elementary trigonometry. □ 
Theorem 3.11. Let us for i = 1 or 2 denote 

*± = [A raax (A s ) - A min (A s )] | sin (Z {x, u t } ± Z {y, itj)| sin (Z {x, y}) , 

and let C = (x,u±) (u2,x) (u\,y) (y,U2). We always have 

*- < \p(x)- P (y)\ < *+. 
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If C is a complex quantity then both inequalities on the left and right are strict. If 
C > (C < 0) then we have equality for the lower (upper) bound. 

Proof. Identity (3.6) makes the statement of the theorem equivalent to 

(3.7) | sin (Z {x, U{} - Z {y, u t }) | < sin Z {x, y} < sin (Z {x, u t } + Z {y, w 4 }) . 

For simplicity of notation, let us assume, without loss of generality, that all vectors 
involved here are normalized, i.e., ||ui|| = 1 1 1*2 1 1 = ||x|| = \\y\\ = 1. Using representa- 
tions of the vectors x and y with respect to the orthonormal basis u\ and Ui of the 
subspace S, we obtain by direct computation 

sin Z {x, y} — cos 2 Z {x, ui} cos 2 Z {y, U2} + cos 2 Z {x, U2} cos 2 Z {y, ui} — 25R(C) 

and 

|C| = cos Z {x, Mi} cos Z {x, U2} cos Z {y, ui} cos Z {y, U2} 

Evidently, -|C| < R((7) < \C\, and if C > (C < 0) then »(C) = \C\ (5R(C) = -|C|), 
which proves bounds (3.7) and also gives conditions of their sharpness. □ 

Remark 3.12. An alternative proof of Theorem 3.11 is based on the fact that 
angles between subspaces describe a metric, see, e.g. [14] and references there, so 

(3.8) \Z {x, u}-Z {y, u}\<Z {x, y} < \Z {x, u} + Z {y, u}\ . 



Theorem 3.11 trivially implies the bound 

\p( x ) ~ P(y)\ < [Amax (-4s) - A min (A s )} sin (Z {x, y}) 

that can be found for the real case in [6]. 

Finally, we apply our results above to an important special case. 
Corollary 3.13. Let x be an eigenvector of A and p(x) — A. Then 

(3.9) |A - p(y)\ = [A max (A s ) - A min (A s )] sin 2 (Z {x, y}) . 
If, in addition Z{x,y} < tt/2, then 

(3.10) |A-p(y)l = l|P ^^ 11 t<m(Z{x,y}). 

\\v\\ 

If also n ^ A denotes an eigenvalue of As, then 

1 \\Psr(y)\\ 



(3.11) t&n(Z{x,y}) = 



v- P (y)\ \\v\\ 



Proof. Since x is an eigenvector of A it is also an eigenvector of As, with the 
same eigenvalue p(x) = A, by Corollary 3.3. The statements (3.9) and (3.10) follow 
directly from Theorems 3.9 and 3.11, correspondingly. Finally, identity (3.11) comes 
from (3.3) and (3.10). □ 
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4. Deriving some known eigenvalue error bounds from our identities. 

We now easily derive two known bounds, reviewed in section 2, from the results 
obtained in section 3. First, since S roin (A) < A min (A s ) < A max (As) < S max (^4), 
from (3.9) we obtain a priori bound (2.1), 



The inequality becomes an equality if E max (A) and S m ; n (A) are eigenvalues of As- 
For details, see reference [6]. 

The second known result, mixed bound (2.2), follows from (3.10), 



The inequality turns into an equality iff Psv(y) = r(y), i.e., iff Ay S span{x, y}, which 
is equivalent to the subspace span{x, y} being ^4-invariant. More detailed information 
about the quality of this bound is determined by Remark 3.8 where vectors x and y 
are swapped to give us the following identity 



Remark 4.1. Our assumption that the operator A is bounded is necessary in 
bound (2.1) and can be used in bound (2.2) to guarantee the existence of the vector 
Ay. Our results of section 3 formally speaking would hold for an unbounded operator 
A and given vectors x and y as soon as the operator As can be correctly defined by 
As = (PsA) |s ; where S = spanjx, y}. One could do even better than that, in fact, 
one only needs to be able to correctly define the values p(x) and p(y). 

Rather than getting into technical details, we illustrate such a possibility using 
the following example from Davis and Kahan [If. Let e = 1/2 and % = 1%. We 
take y = (1, e, e 2 , • • • , e™, • • • ) E Z 2 and A = diag(l, e _1 , e -2 , • • • , e~™, • • • ). We get 
p(y) = 1 + e even though the vector Ay = (1, 1, 1, • • • ) has an infinite norm in 12- 
Let us take x = (1,0, 0, •••), the eigenvector corresponding to the lowest eigenvalue 
p(x) = 1, and define w = (0, e, e 2 , e 3 , ■ ■ • ) with p(w) = (1 + e)/e. Then As can be 
constructed using its eigenvectors x and w and the corresponding eigenvalues p(x) and 
p(w) despite of the fact that Aw = (0, 1, 1, ■ • • ) does not exist in % = I2 

We have now demonstrated how two known bounds, a priori bound (2.1) and 
mixed bound (2.2), can be derived in a simple and uniform manner from essentially 
two dimensional identities (3.9) and (3.10). The last well known bounds, we review 
in section 2, the a posteriori Temple bound (2.3) and its follow-up (2.5), can also be 
derived in a similar way from their two dimensional prototype (3.3). However, the 
derivation is not so trivial, so we dedicate a separate section for it, coming next. 

5. New a posteriori bounds for eigenvalues. For convenience, let us remind 
the reader of the Temple bound (2.3) 



I A - p(y)\ = [A max (A s ) - A min (As)] sin 2 (Z {x, y}) 
< [E max (A) - £ min (A)] sin 2 (Z {x, y}) . 



|A-P(y)| 



< 




tan (Z {x, y}) 



t&n (Z{x,y}) 



||r(i/)||oos(Z{r(i/),a:}) = \\P s r(y)\\cos(Z{P s r(y),x}). 



08 - p(v)) (p(v) 



a) < 



\Hy)\? 
h\\ 2 
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where a < p(y) < /3 and E (A) n (a, f3) = 0, and its two dimensional analog (3.3) 
[A max (As) - p(v)\ Ip(u) A min (A s )} = llPs " jg> . 

Our primary goal is to derive (2.3) from (3.3). We first do it for an important partic- 
ular case, where we can even improve (2.3). 

Lemma 5.1. Let a = A mm (A) be an isolated point of the spectrum of A and ft > a 
be the nearest larger point of the spectrum of A. Let X denote the complete eigenspace 
corresponding to the eigenvalue a. For a given vector y ^ 0, such that a < p(y) < (3, 
we define x = Pxy, where Px is the orthogonal projector to the subspace X . Assuming 
that x j£ 0, we denote S — span{x,y}. Then 



(5-1) (f3-p(y))(p(y)-a)< l|P ^ ||2 , 

where P$ is the orthogonal projector on the subspace S. 

Proof. The assumption p(y) > a implies y ^ X since a = A m j n (A), so x ^ y. The 
vector x is an eigenvector of As, and p(x) = A m j n (A) = A lmn (As). Let X 1 - denote 
the orthogonal complement to the subspace X. The vector w = y — Pxy £ Sfl X^ 
is an eigenvector corresponding to the other eigenvalue of As, i.e., p(w) = A max (As). 
Since X is the complete eigenspace corresponding to the eigenvalue a = A min (A) , the 
fact that w G X^ guarantees that f3 < p(w). Therefore 

A mi „ (-4s) = a < p(y) < [3 < p(w) = A max (A s ) . 

Substituting (3 < A max (As) for A max (As) in identity (3.3) gives the desired bound. □ 

The Temple bound (2.3) follows from bound (5.1), using ||Psr(j/)|| < 
In Remark 4.1 we have already discussed that having a bound based on \\Psr (y) \\ 
rather than on ||r (y) \\ may give great advantages. 

Of course, Lemma 5.1 can be easily reformulated for the case of the opposite side 
of the spectrum of the operator A. Returning our attention to our main goal, we can 
construct simple examples showing that bound (5.1) docs not hold if neither a, nor /3 
corresponds to the extreme points of the spectrum of the operator A. The extension of 
bound (5.1) to the general case is described below. We start with a technical lemma. 

Lemma 5.2. Let p(y) ^ S(A) for a given vector y ^= 0. Let us denote by U the 
invariant subspace of A associated to all of the spectrum of A larger than p(y), and 
define V = U + span{y}. Then Yi(Av) = p(v) U E (Ajj) , and the smallest point of the 
spectrum E m ; n (Ay) < p(y) is an isolated point of the spectrum E (Ay) — an eigenvalue 
of multiplicity one. 

Proof. Since V = U + spanjy}, we have dim (V (1 U^) = 1. We define a nonzero 
vector v = (I-Pjj)y E V(1U ± , and notice that V = U ©span{?;} is an orthogonal sum. 
Since U is an A-invariant subspace, it is also Ay-invariant. Then the decomposition 
V = U © span{w} implies that v is an eigenvector of operator Ay, corresponding 
the eigenvalue p(v). Moreover, by the variational principle for the Rayleigh quotient, 
p(v) < p(y) < E min (Ay), according to the definition of U. Thus, we deduce that 
E (Ay) = p(v)UTi (Ajj), where p(v) = E m j n (Ay) is an isolated point. The multiplicity 
of the eigenvalue p(v) is one, because, again, dim (V fl U^) = 1.0 

We are now prepared to prove our main result in this section. 

Theorem 5.3. Let a < p(y) < f3 where a and j3 are the nearest points of the 
spectrum of A to p(y). Let U be the invariant subspace of A associated to all of the 
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spectrum of A larger then p{y). We define x = (I — Pu)y and S = span{x,y}. Then 

llp gr .(..M|2 



(5-2) ([3 - p(y)) (p(y) - a) < 

IMP 

Proof. Applying Lemma 5.2 and noticing that S m ; n (Ajj) = (3, we observe that 
the assumptions of Lemma 5.1 arc satisfied if Ay replaces A, where the scalar (3 and 
the eigenvalue £ m ; n (Ay) are the nearest points of the spectrum of the operator Ay 
to p(y,Ay) = p(y). Thus, by Lemma 5.1, 

(R mu^I V (A " <r \\ P sr(y,A v )f \\Psr(y)\\ 2 

p - p(y)) (p{y) - s mi „ {Ay)) < 1— = — ^-^ — . 

IMP IMP 

Since £ m i n (Ay) < a < p(y), we obtain (5.2). □ 

Remark 5.4. Substituting A with —A in Lemmas 5.1 and 5.2, and using similar 
arguments, we end up with the same subspace S = span{P[/y, y} = span{(/ — Pu)y, y} 
and obtain exactly the same bound. 

Repeating the arguments of section 2 with appropriate modifications, wc obtain 
improved versions of the Kato- Temple inequality and other bounds, reviewed in sec- 
tion 2, from Theorem 5.3. Specifically, let the only point of the spectrum in the 
interval (a, 6) be an eigenvalue A, and let also p(y) £ (a,b). Defining the subspace S 
as in Theorem 5.3, we get 

1 \\Psr(y)\\ 2 < _ A ^ 1 \\Psr(y)\\\ 

p(y) - a IMP ~ ~ b - p(y) IMP 

Taking a = p(y) - 5 and b = p(y) + S, where 6 = min ne ^(A)/\}\v - p{y)\, implies 

which, in its turn, gives us the improved Krylov-Wcinstcin bound, 

lx ,.,m ^ \\Psr(y)\\ 
A- p{y)\ < — jpii — . 

\\y\\ 

Remark 5.5. We note that ||Psr(y)|| < \\P v r(y)\\ < \\r(y)\\, since S C V C U. 
Thus, first, the classical a posteriori bounds now trivially follows from our bounds. 
Second, if the smallest quantity \\Psr(y)\\ may not be readily accessible in a practical 
situation, it can be bounded above by the following much simpler expression, \\Pyr(y)\\, 
which can still be dramatically smaller compared to the standard value \\r(y)\\ used in 
Temple bound (2.3). As an alternative for V , we can take U 1 - + span{y} D S. 

6. Improving known error bounds for eigenvectors. The main topic of this 
work is RQ bounds. However, having the machinery of identities already constructed 
in section 3, we need only very little extra effort to revisit and improve some well 
known error bounds for eigenvectors, as a bonus. The reader might have noticed a 
couple of identities, (3.4) and (3.11), that have not yet been used. For convenience, we 
repeat them here, changing the notation in (3.4), for the case where S = span{a;,y}, 
the vector x is an eigenvector of A (and, thus, As) with the eigenvalue A = p(x), the 
scalar n ^ A denotes the other eigenvalue of As, and 9 — Z {x, y}: 



(6.1) 



RQ perturbation bounds 
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and, if 6 < ir/2, 

1 \\Psr(y)\\ 



(6.2) tan (6) 



\v-p(y)\ \\y\\ 



In order to use identities (6.1) and (6.2), one needs to bound the scalar rj. It is 
easily possible in the special case, where A = A m ; n (A) or A = A max [A). The latter 
choice is reduced to the former one by substituting A with —A. We can use the 
arguments from the proof of Lemma 5.1 to handle the case A = A m i n (A). Let the 
assumptions of Lemma 5.1 be satisfied, i.e., let a = A m j n (A) be an isolated point of 
the spectrum of A and (3 > a be the nearest larger point of the spectrum of A. Let X 
denote the complete eigenspace corresponding to the eigenvalue a. For a given vector 
y 7^ 0, such that a < p(y) < (3, we define x = Pxy, where Px is the orthogonal 
projector to the subspace X: and we denote S = spanja;, y}. Then A = p(x) = a and 
13 < rj, directly by the proof of Lemma 5.1. Substituting f3 < 77 in (6.1) and (6.2), we 
obtain the following improvements of the corresponding bounds from [1, 8]: 



sin (20) < 2 



and, if < tt/2, 



/3- A 



tan(*)< 1 



P - p(v) \\v\\ 



Conclusion. Not only have we found a new unifying proof technique, based 
on RQ identities. We have also made a significant improvement, adding an extra 
projector in front of the residual in the bounds that involve the norm of the residual. 
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